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Abstract 

Background: Developing a water quality index which is used to convert the water quality dataset into a single number 
is the most important task of most water quality monitoring programmes. As the water quality index setup is based on 
different local obstacles, it is not feasible to introduce a definite water quality index to reveal the water quality level. 

Findings: In this study, an innovative software application, the Iranian Water Quality Index Software (IWQIS), is 
presented in order to facilitate calculation of a water quality index based on dynamic weight factors, which will help 
users to compute the water quality index in cases where some parameters are missing from the datasets. 

Conclusion: A dataset containing 735 water samples of drinking water quality in different parts of the country was used 
to show the performance of this software using different criteria parameters. The software proved to be an efficient tool 
to facilitate the setup of water quality indices based on flexible use of variables and water quality databases. 
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Introduction 

The Water Quality Index (WQI) is introduced as a 
mathematical instrument to convert the water quality 
dataset into a single number which represents the water 
quality level while eliminating subjective assessments of 
water quality and biases of individual water quality ex- 
perts [1]. Application of water quality indices allows the 
assessment of changes in water quality over time and 
space and also the evaluation of the efficacy of domestic 
policies and international strategies designed to protect 
aquatic resources [2]. Water quality indices are also used 
for the classification of water [3]. 

Ramakrishnaiah et al [4] presented a groundwater 
WQI which was based on 12 parameters: pH, Total 
Hardness(TH), Ca ++ , Mg ++ , HC0 3 ~, CI", N0 3 ~, S0 4 ~, 
Total Dissolved Dolids (TDS), Fe ++ , Mn ++ , and F". 
According to their presented method, the values of these 
12 parameters should be monitored to calculate the 
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WQL Relative weight factors of the mentioned twelve 
parameters should also be calculated, and there is no 
way to calculate the WQI when parameters included 
in the computation of the index are missing from 
the datasets. In many countries, water monitoring 
programmes are decentralized and different water moni- 
toring sectors include their choices of parameters in rou- 
tine periodic sampling and analysis. Therefore, the use 
of water quality indices which are based on fixed param- 
eters overlooks large data records during the process of 
computing the WQI, especially when the index is not 
defined according to available data in the database. In 
many areas, especially those with extensive use of agro- 
chemicals, it is necessary to consider pesticides as 
health-risk-based parameters. Furthermore, in industria- 
lized areas with high levels of potentially harmful an- 
thropogenic pollutants, the role of organic solvents such 
as carbon tetrachloride, trichloroethylene, and per- 
chloroethylene as potential criteria pollutants should not 
be overlooked; otherwise, some particular water sources 
may receive good scores and yet have water quality im- 
paired by parameters not included in the index. 



o 
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According to the above mentioned cases, it is not 
practical to set up a WQI with definite criteria pollu- 
tants which could be effectively used in all cases. There- 
fore, software is needed to enable water quality experts 
to set up their own water quality indices. Furthermore, 
facilities should be presented for the efficient use of 
parameters in water quality datasets which contain miss- 
ing values. In this study, a software named as the Iranian 
Water Quality Index Software (IWQIS) was developed to 
address these issues. 

Materials and methods 

Water quality index background 

Two indices were calculated in 1988; the degree of con- 
tamination for health-risk -based parameters (F~,N03~, 
U02 2 ", As, B, Ba, Cd, Cr, Ni, Pb, Rn, and Se), and the de- 
gree of contamination for technical-aesthetic parameters 
(pH, KMn04 consumption, S0 4 ", CI", Ag, Al, Cu, Fe, 
Mn, Na, and Zn) [5]. In another study, nine variables were 
considered: nitrate, phosphate, chloride, TDS, biological 
oxygen demand, cadmium, chromium, nickel, and lead [6]. 

Stigter et al. [7] created a groundwater quality index 
(GWQI) with a method based on multivariate analysis for 
monitoring the influence of agriculture using parameters 
of groundwater chemistry and potability and tested its ap- 
plicability in the south of Portugal. They included nitrate, 
sulphate, chloride, and calcium in their presented index. A 
groundwater quality index (GWQI) was also developed to 
assess water quality affected by a landfill site based on 
seven variables [8]. In this study, creation of the index 
was based on Principal Component Analysis (PCA) and 
benchmarking analysis. They showed that seven variables, 
electric conductivity, TDS, salinity, nitrate, chemical oxy- 
gen demand, and iron, could be used as indicators. Simoes 
et al. [9] proposed a Water Quality Index for management 
purposes in the Medio Paranapanema Watershed in Sao 
Paulo State, Brazil, as a pollution indicator for aquaculture 
activity based on three parameters: turbidity, total phos- 
phorus, and dissolved oxygen. They showed that the water 
quality degradation in the studied area due to aquaculture 
activity could be described with this simple index. 

The groundwater quality in Sunamganj, Bangladesh, was 
studied based on different indices for irrigation and drinking 
uses. Parameters such as absorption ratio, soluble sodium 
percentage, residual sodium carbonate, electrical conduct- 
ance, magnesium adsorption ratio, Kelly s ratio, total hard- 
ness, permeability index, and residual sodium bi-carbonate 
were included to investigate the ionic toxicity [10]. 

Terrado et al. [11] selected the WQI of the Canadian 
Council of Ministers of the Environment (CCME WQI) 
as the most suitable index. It gives a number between 0 
(worst quality) and 100 (best quality). They also performed 
a sensitivity analysis for the CCME WQI to select the best 
procedure for optimizing the WQI according to input 



data. Sharma and Patel [12] collected various seasonal 
groundwater samples for some consecutive years and the 
respective physiochemical analysis was carried out for five 
groundwater quality parameters (pH, TDS, chlorides, 
hardness, and electrical conductivity) which are essentially 
responsible for groundwater quality degradation in the 
studied area. They indicated that the groundwater of the 
study area needs to achieve a considerable degree of qua- 
lity improvement by the most feasible approach such as 
artificial groundwater recharging. Yidana et al. [13] deve- 
loped a groundwater classification scheme using a robust 
WQI modified for the case of the Keta basin and classified 
groundwater in their study area into 'good; 'fair! and mar- 
ginal' water types using ordinary kriging developed from a 
well fitted linear semivariogram function. Recently, a glo- 
bal, country-level Water Quality Index (WATQI) was de- 
veloped as a research and policy-making tool for the 
measurement and management of freshwater quality based 
on data from the UNEP GEMS/Water programme and 
the European Environment Agency (EEA) [14]. 

Omo-Irabor et al. [15] subjected the chemical data set 
to PCA/FA, and Hierarchic Cluster Analysis (HCA). The 
aim of this study was to determine the nature and 
spatial distribution of chemical pollutants in surface and 
groundwater resources in the western Niger Delta re- 
gion. Yidana et al. [16] used the multivariate method to 
analyse surface water hydrochemical data from different 
locations along the Ankobra Basin, Ghana. They aimed 
to extract principal factors related to different sources of 
variation in the hydrochemistry, and therefore they com- 
bined PCA and CA to classify water samples into specific 
groups on the basis of hydrochemical characteristics. 
Banoeng-Yakubo et al. [17] calculated a WQI for samples 
using concentrations of Na + , Ca ++ , Mg ++ , CI", N0 3 ", F", 
and EC at various sample locations. R-mode HCA and 
factor analysis (using varimax rotation and the Kaiser 
Criterion) were used to find the significant sources of 
variation in the hydrochemistry. They classified the WQI 
values into five categories as follows (<50: excellent water; 
50-100: good water; 100-200: poor water; 200-300 very 
poor water; >300: water unsuitable for drinking). Saeedi 
et al. [18] used a WQI to analyse the nature and rate of 
land use change and its associated impact on groundwater 
quality. In this study, a methodology based on multivariate 
analysis was developed to create a GWQI that aimed to 
identify the places with the best quality water for drink- 
ing within the Qazvin province in western central Iran. 
Al-Shami et al. [19] studied the abundance and diversity of 
benthic macroinvertebrates as well as physico-chemical pa- 
rameters in five rivers of the Juru River Basin in northern 
Peninsula Malaysia. The physico-chemical parameters and 
calculated WQI were significantly different among the in- 
vestigated rivers (ANOVA, p < 0.05). They concluded that 
the multivariate analysis (CCA) was highly satisfactory, 
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explaining 43.32% of the variance for the assemblages of 
macroinvertebrates as influenced by 19 physical and 
chemical variables. 

Bu et al. [20] studied the sampled water quality at 12 
sampling sites in the Jinshui River of the South Qinling 
Mountains in China. It was confirmed that 25 studied 
water quality variables had significant temporal diffe- 
rences (p < 0.01) and spatial variability (p < 0.01). Based 
on the similarity of water quality variables and applica- 
tion of cluster analysis, the 12 sampling sites were classi- 
fied into three pollution level groups (no pollution, 
moderate pollution, and high pollution). Razmkhah et al. 
[21] applied PC A and HCA methods to determine the 
water quality of Jajrood River (Iran) and to assess and 
discriminate the relative magnitude of anthropogenic and 
natural influences on the quality of river water. T, EC, pH, 
TDS, NH 4 + , N0 3 ", N0 2 ", Turbidity, Total Hardness, Ca ++ , 
Mg ++ , Na + , K + , CI", S0 4 ", and Si0 2 were selected as the 
physico-chemical variables and total coliform and faecal 
coliform as the biochemical variables to be analysed in 
the water samples from 18 sampling stations. 

In another study, parameters such as dissolved oxygen 
(DO), biochemical oxygen demand (BOD), pH, temperature, 
TDS, turbidity, faecal coliform, heterotrophic plate count, 
hardness, alkalinity, arsenic, lead, mercury, nickel, cad- 
mium, chromium, total phosphorous, H 2 S, nitrate, and 
fluoride were selected to develop the quality of drinking 
water supplied to dairy cattle based on fuzzy logic using 
trapezoidal membership functions [22]. In our recent 
study, we selected twenty parameters which were in- 
cluded based on their critical importance for the overall 
water quality and their potential impact on human health 
to assess the performance of the proposed index under 
actual conditions. The comparison of the outputs of the 
fuzzy-based proposed index with those of the NSF WQI 
and Canadian Water Quality Index (CWQI) showed simi- 
lar results and were sensitive to changes in the level of 
water quality parameters [23]. 

Water quality index setup 

The structure of variables, weights, mathematical relation- 
ships, and specific features of the GWQI presented in this 
study are described in this section. For different water 
quality indices, various variables may be selected according 
to the importance of the parameters and availability of 
data. In this study, we developed software which enables 
users to choose different parameters according to the de- 
sired criteria pollutants. In the software, the user can select 
up to 40 variables which are supposed to be responsible for 
water contamination based on the importance of the vari- 
ables, the availability of data, and experts' professional 
judgements. The most frequently used variables in other 
studies which are used in water monitoring programmes 



and in our national monitoring water activities are set as 
default parameters. 

In this study, we tested the performance of IWQIS on a 
database with 735 water samples from different drinking 
water resources in the country. The selected parameters, 
weights, and limit values which were used to set up the 
WQI for the mentioned dataset are presented in Table 1. 
Ramakrishnaiah et al. [4] selected total hardness, calcium, 
and magnesium in their index. Although the total hardness 
data were available during the setting of the WQI, we did 
not include it in the criteria parameters, since the total hard- 
ness could be calculated by calcium and magnesium and in- 
cluding the calcium, magnesium, and total hardness at the 
same time would cause bias in the computation of the WQI. 
It should be noted that interpretation of the calculated WQI 
was performed according to the classification presented in 
Table 2, which was presented by Sharma and Patel [12]. 

The main concept and incentive for developing the 
IWQIS was to facilitate the computation of WQI with 
more flexibility and to make the calculation of the WQI 
feasible in cases where some data related to selected criteria 
pollutants are missing from the database. It is very common 
to find missing values in some records of water quality da- 
tabases. As mentioned, all the previous water quality indi- 
ces were based on the use of fixed parameters and their 
definite weights. The practical shortcoming of these indices 
appears when one or more parameters are not available in 
a record set. In these cases the other data could not be used 
for calculation of the index, since the weights are fixed and 
cannot be changed. In the method presented in this study, 
the weights are dynamic and in cases where users face a 
lack of data in records of water samples, the new relative 
weights are recalculated according to the available data. 



Table 1 Criteria parameters, weight factors, and limit 
values considered for setting up the water quality index 



Criteria parameters 


Weights 


Limit values(mg/L) 


pH 


4 


6.5-8.5 


Calcium 


2 


300 


Magnesium 


2 


30 


Chloride 


3 


250 


TDS 


4 


1000 


Fluoride 


4 


1.5 


Manganese 


4 


0.1 


Nitrate 


5 


50 


Iron 


4 


0.3 


Sulfate 


4 


250 


Ammonium 


3 


1.5 


Sodium 


3 


200 


Turbidity 


4 


5 
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Table 2 Water quality classification based on WQI values 


Water quality index values 


Interpretation 


<50 


Excellent water quality 


50-100 


Good water quality 


100-200 


Poor water quality 


200-300 


Very poor water quality 


>300 


Unsuitable for drinking 



The concept of dynamic relative risks is illustrated 
in Figure 1 for a single record in an Excel data sheet 
(which can be downloaded from http://tums.ac.ir/ajax 
plorer/data/public/2941eae50882fladcb47436ef78c0el6.php 
?lang=en) to familiarize readers with the presented index 
and the idea of using dynamic weights for computation of 
the WQI. 

As shown in this example, the user has selected 21 
items as criteria parameters. Total Hardness is ex- 
cluded by typing -1 in the column S (indicated by x) 
and 20 parameters and 2,4 D (as user define 1) are in- 
cluded by typing 1 (indicated by V) in the column S. 



No. S Parameter 


Wi 


Si 


Ci 


Practical 
Wi 


Dynamic 
Relative-Wi 


qi = (Ci / Si ) x 100 


Sli=Wi.qi 


1 O pH 


4 


6.5 


8.07 


4 


0.05556 




0.00 


2 Total hardness 


2 


200 


394.8 


0 


0.00000 


197.400 


0.00 


3 © Calcium 


2 


300 


114.8 


2 


0.02778 


38.267 


1.06 


4 (J Magnesium 


2 


30 


6.3 


2 


0.02778 


21.000 


0.58 


5 © Alkalinity 


3 


200 


110 


3 


0.04167 


55.000 


2.29 


6 Chloride 


3 


250 


45.5 


3 


0.04167 


18.200 


0.76 


7 O Total dissolved solid 


4 


1000 


497 


4 


0.05556 


49.700 


2.76 


8 Fluoride 


4 


1.500 


3.000 


4 


0.05556 


200.000 


11.11 


9 {J Manganese 


4 


0.1 


0.05 


4 


0.05556 


50.000 


2.78 


10 O Nitrate 


5 


50 


30 


5 


0.06944 


60.000 


4.17 


11 (^) Iron 


4 


0.300 


1.320 


4 


0.05556 


440.000 


24.44 


12 O Sulphate 


4 


250 


82.9 


4 


0.05556 


33.160 


1.84 


13 Ammonium 


3 






0 


0.00000 


0.000 


0.00 


14 J Aluminum 


3 


0.10 


0.0500 


3 


0.04167 


50.000 


2.08 


15 Copper 


3 


1.0000 


0.5000 


3 


0.04167 


50.000 


2.08 


16 O Zinc 


3 


3.0000 


1.5000 


3 


0.04167 


50.000 


2.08 


17 © Mercury 


4 


0.0060 


0.0030 


4 


0.05556 


50.000 


2.78 


18 O Cadmium 


4 


0.0030 


0.0015 


4 


0.05556 


50.000 


2.78 


19 Q Chromium 


4 


0.0500 


0.0250 


4 


0.05556 


50.000 


2.78 


20 "j Lead 


4 


0.0100 


0.0050 


4 


0.05556 


50.000 


2.78 


21 O Arsenic 


4 


0.0100 


0.0050 


4 


0.05556 


50.000 


2.78 


22 © 2,4 D 


4 


0.0100 


0.0050 


4 


0.05556 


50.000 


2.78 


23 O Userdifine2 








0 


0.00000 


0.000 


0.00 


24 O Userdifine3 








0 


0.00000 


0.000 


0.00 


25 O Userdifine4 








0 


0.00000 


0.000 


0.00 


26 © Userdifine5 








0 


0.00000 


0.000 


0.00 


27 O Userdifine6 








0 


0.00000 


0.000 


0.00 


28 © Userdifine7 








0 


0.00000 


0.000 


0.00 


29 © Userdifine8 








0 


0.00000 


0.000 


0.00 


30 © Userdifine9 








0 


0.00000 


0.000 


0.00 


31 O UserdifinelO 








0 


0.00000 


0.000 


0.00 


32 © Userdifinell 








0 


0.00000 


0.000 


0.00 


33 © Userdifinel2 








0 


0.00000 


0.000 


0.00 


34 O Userdifinel3 








0 


0.00000 


0.000 


0.00 


35 © Userdifinel4 








0 


0.00000 


0.000 


0.00 


36 © Userdifinel5 








0 


0.00000 


0.000 


0.00 


37 © Userdifinel6 








0 


0.00000 


0.000 


0.00 


38 O Userdifinel7 








0 


0.00000 


0.000 


0.00 


39 © Userdifinel8 








0 


0.00000 


0.000 


0.00 


40 O Userdifinel9 








0 


0.00000 


0.000 


0.00 


SUM 


77 






72 


1 


WQI 


74.72 



Figure 1 The concept of dynamic relative risks for a single sample in excel datasheet. 
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Although ammonium was initially selected as a criterion 
criteria parameter, it was excluded from the process in 
this single record because there were no data for this pa- 
rameter. To exclude a parameter for which there is no 
available data, the user can simply type a zero in column 
S (indicated by !). Dynamic relative weights would be 
recalculated according to the selected parameters to 
compute the WQL In this way, the data in a dataset 
which may have some missing values for some parame- 
ters could be effectively used. For pH, the quality value is 
considered zero when the pH is between 6.5 and 8.5. For 



pH values less than 6.5, the quality value is computed 
according to the following formula: 



qvalue pH = (6.5/pH * 100) * Dynamic WeightofpH 



(i) 



and for pH values greater than 8.5, it is calculated 
through the following relationship: 



qvalue pH = (pH/8.5 * 100) * DynamicWeightofpH (2) 



Install IWQIS 



Note: The software is simply installed via running the setup file. It is designed to 
work on computers which have Microsoft Windows XP or higher versions. 



Set a New Database or Load a Previously Built database 



Noted: User should create a database if it has not been created before, and then 
click the "Set database" button to load the built database. 



Define Criteria parameter 



Note: Desired water quality index can be defined by clicking the criteria button. 
Default criteria could be kept or changed by user, parameters, weights, and limit 
values can be defined according to the desired structure of index. 



Data Entry 



Note: User can add /edit the data to the created database. There are also filtering 
capability available in the data entery form. 



Data Processing and Report Generation 



Note: Excel workbook with three worksheets "Original data", "Quality values", and 
"Water quality Index" will be gererated. Software recalculates the dynamic 
relative weights for the records in which one or more parameters have no data. 



Figure 2 Diagram to illustrate how the software works. 
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For the other parameters, quality values are calculated 
according to the following formula. 

Qualityvalue — (Cone, of parameter /Limit Value * 100) 

* Dynamic Weightof parameter (3) 

As previously mentioned, quality scores are deter- 
mined dynamically for parameters which have available 
data in the water quality dataset The water quality 
index, a dimensionless number, is determined as the 
sum of all quality values for those constituents chosen 
by the user as criteria parameters. 



Software specification 

In this study, user-friendly software has been developed 
according to the concept of dynamic weights allocation to 
make the computation of the WQI simple. This package is 
called the Iranian Water Quality Index Software (IWQIS) 
and can be effectively used to process water quality data 
according to the user s choice of parameters, weights, and 
limit values. The authors provide access to the mentioned 
software (via: http://tums.ac.ir/ajaxplorer/data/public/ec21 
f02fcf2f681a89a2f7500c83dle6.php?lang=en) in order to 
simplify the water quality assessment monitoring activities. 

In this section, the software requirements, capabilities, 
and application are described. Figure 2 illustrates how 
the software works. It is simply installed by running the 
setup file which is designed to work on computers with 
Microsoft Windows XP or more recent versions. Users 
should also have Microsoft Excel installed on their com- 
puters, since the reports are designed to be transferred 
in Excel workbooks. Excel reports enable the user to 
perform additional analysis on the output files. 

As shown in Figure 3, when the user clicks on the 
icon, the program starts and the form appears. Using 
this form the user is able to set a new database or load a 
previously built database. 

It should be noted that the user should create a data- 
base if it has not been created before, and then click the 
"Set database" button to load the built database. The cri- 
teria parameters of the desired WQI can be defined by 
clicking the criteria button. Figure 4 shows the facilities 
which are provided for the user to define the criteria pa- 
rameters of the index. In this form, parameters, weights, 
and limit values can be defined according to the desired 



« Table of Main Criteria (Weights and Limits) (Ik 




Parameters 


Weights 


Limits(mg/I) 


Parameters 


Weights 


Limits(mg/I) 




















17 | 


pH 


I 4 


| G.5 fil 


r 


22 


UterDefeedl 








r 2 






| 200 


r 


23 


U S erDefined2 








17 3 


| Calcium 


I 2 


| 300 


r 


24 


User Defined:!: 








[7 4 


| Magnesium 


2 


30 


r 


25 


User Defined4 


I 0 


o 




17 5 


| Alkalinity 


3 


200 


r 


26 










17 6 


| Chloride 


3 




250 


r 


27 


U« Define* 








(7 7 


TDS 


4 




1000 


r 


28 










[7 8 


| Fluoride 


4 


I 1.5 


r 


29 


User Defined:? 


I P 






|7 9 


| Mamganese 


4 


1 0.1 


r 


30 




I ' 


I '- 




17 10 


Nitrate 




I 50 


r 


31 


User DefinedIO 








|7 11 


Iron 


4 


0.3 


r 


32 


User Defined11 


I - 






|7 12 


Sulfate 


4 


250 


r 


33 










[7 13 


| Ammonium 


3 


I 1-5 


r 


34 


User Defined! 3 




I 0 




|7 14 


I Aluminium 


3 




0.1 


r 


35 


, . -,; • .' 








|7 15 


Copper 


3 




1 


r 


36 


UserDefrned15 








|7 16 


Zinc 


3 


1 3 


r 


37 


UserDefined16 








|7 17 


| Mercury 




0.006 


r 


38 


U«erDehned17 








17 18 


| Cadmium 




0.003 


r 


39 


UserDef,ned18 








[7 19 


| Chromium 




0.05 


r 


40 


Use,Def.ned19 


I 0 






|7 20 


Lead 




0.01 












[7 21 


| Arsenic 




0.01 




































Save 


Clear All 


I 







Figure 4 Facilities of IWQIS provided for the user to define the criteria parameter, weights, and limit values. 
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pH Values 
Figure 5 Spatial variability for pH. 



structure of the index. In this software, 21 parameters 
are set as default which can be selected by user. There 
are also 19 user-defined parameters which can be set as 
criteria parameters based on the availability of data and 
experts' professional judgements. After defining the pa- 
rameters, weights, and limit values, the user can add the 
data to the created database and generate an output 
report. 

The report of IWQIS is generated as an Excel work- 
book with three worksheets, "Original Data", "Quality 
Values", and "Water Quality Index". The first sheet, 
Original Data, includes the data which were previ- 
ously entered in the database. The second sheet pre- 
sents the calculated quality values for each parameter 
and the third sheet includes WQI and the related 
interpretations. 



Spatial variability and principal component analysis 

Recently, multivariate statistical methods have been used 
to characterize and evaluate surface and groundwater. 
Chemical, biological, and physical data were monitored 
at 12 locations along the Passaic River, New Jersey and 
analysed in a study performed by Bengraine and Marhaba 
[24]. PCA was used to extract the factors related to the 
hydrochemical variability and to demonstrate the spatial 
and temporal changes in water quality. Singh et al. [25] 
used cluster analysis (CA), factor analysis (FA), PCA, and 
discriminant analysis (DA) of the dataset on water quality 
of the Gomti River (India). They concluded that 10 pa- 
rameters (river discharge, pH, BOD, CI, F, P0 4 , NH 4 -N, 
N0 3 "N, TKN, and Zn) contributed to 97% correct 
assignations in the spatial analysis of three different re- 
gions in the basin. Zhou et al. [26] showed that multi- 
variate statistical methods are useful for interpreting 
complex data sets in the analysis of temporal and 
spatial variations in water quality and could be used 
for the optimization of a regional water quality moni- 
toring network. 

In this study, the spatial variability in the dataset with 
735 drinking water samples in the country was illus- 
trated using box plots. After filtering records with miss- 
ing values, PCA was performed to find the meaningful 
components. The retained components were used to 
perform a linear model. Finally, the fitness of predictions 
of the principal component model generated and the 
WQI computed by IWQIS was determined. It should be 
noted that PCA was performed using R software [27]. 

Results and discussion 

Stambuk-Giljanovic [1] believes that lack of consent for 
the selection of quality evaluation parameters is the 
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Figure 6 Spatial variability for calcium, magnesium, chloride, TDS, nitrate, sulfate, and sodium. 
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greatest obstacle to a broader index application in the 
world. Rickwood and Carr [2] published a list of all pos- 
sible parameters, their associated WHO guidelines, and 
whether they were measured in 20%, 35%, and 50% of 
countries in all regions: Europe, Asia, Africa, Americas, 
and Oceania. The appropriate selection of criteria vari- 
ables from the list for setting the quality index is still the 
most important task. In this study, the selection of vari- 
ables was essentially based on the availability of data on 
the national scale. We tried to choose those parameters 
which are commonly measured in water monitoring 
programmes. In this stage, the objective of our study was 
to show a general picture of drinking water quality using 
widely selected water samples from around the country. 

Figures 5, 6 and 7 show the spatial variability of drink- 
ing water quality parameters in the database. 

Multivariate analysis carried out by means of PCA 
resulted in four components, which accounted for 74% 
of the spatial changes. Table 3 presents standardized 
loadings based upon the correlation matrix in the ap- 
plied PCA. As shown in Table 3, the components having 



Table 3 Standardized loadings (pattern matrix) based 
upon correlation matrix 



p 


PCI 


PC2 


PC3 


PC4 


SS loadings 


6.09 


1.76 


1.34 


1.19 


Proportion Var 


0.44 


0.13 


0.1 


0.08 


Cumulative Var 


0.44 


0.56 


0.66 


0.74 


Proportion explained 


0.59 


0.17 


0.13 


0.11 


Cumulative proportion 


0.59 


0.76 


0.89 


1 



SS loadings or eigenvalues greater that one were 
retained. The first component accounts for about 44% of 
the variance, the second component for about 13%, and 
the third and fourth for about 10% and 8%, respectively. 

Table 4 presents the loading of each variable under 
each of the four components. The first principal compo- 
nent represents the most important process or mixed 
process controlling the hydrochemistry, which has the 
highest eigenvalue and accounts for the highest variance 
in the component matrix. 

In this study, the first component, which accounts for 
about 44% of the variance, has high positive loadings 
for magnesium, chloride, TDS, fluoride, and sulfate, and 
could be due to the dominant share of groundwater 
resources in supplying drinking waters. The second 
principal component accounts for about 13% of the 
hydrochemistry and has a high positive loading for cal- 
cium. This factor could be related to higher alkalinity 
of groundwater due to bicarbonate ions. The third prin- 
cipal component accounts for about 10% of the vari- 
ance in the hydro-chemical data and has high positive 
loadings for NO3 and turbidity. This could be attributed 
to the impact of domestic waste and agricultural activ- 
ities. The fourth principal component represents about 
8% of the variance in the hydrochemistry of drinking 
water in the country and has high positive loadings for 
ammonium, which is an indication of agricultural prac- 
tice with excessive use of fertilizers. 

Using the factor scores, a linear regression model was 
developed to investigate the fitness of four principal com- 
ponents and the WQI which was computed by IWQIS. 
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Table 4 Component matrix 



Parameters 


PCI 


PC2 


PC3 


PC4 


h2 


u2 


pH 


0.38 


-0.58 


-0.15 


0.16 


0.53 


0.469 


Calcium 


0.29 


0.77 


0.06 


-0.12 


0.61 


0.389 


Magnesium 


0.76 


0.17 


0.11 


-0.11 


0.64 


0.364 


Chloride 


0.96 


-0.07 


-0.03 


0.01 


0.93 


0.075 


TDS 


0.98 


-0.06 


-0.02 


0.01 


0.96 


0.043 


Fluoride 


0.90 


-0.07 


-0.04 


0.01 


0.81 


0.19 


Manganese 


0.12 


0.73 


-0.3 


0.12 


0.66 


0.342 


Nitrate 


-0.17 


0.42 


0.69 


0.02 


0.68 


0.323 


Iron 


-0.24 


0.17 


-0.35 


0.01 


0.21 


0.79 


Sulfate 


0.89 


0.31 


-0.06 


0.01 


0.89 


0.107 


Ammonium 


0.04 


-0.08 


0 


0.98 


0.96 


0.035 


Sodium 


0.95 


-0.06 


-0.09 


0.05 


0.92 


0.083 


Turbidity 


-0.06 


-0.1 


0.76 


0 


0.6 


0.404 



The outputs of the linear regression model for the four 
retained principal components are presented in Table 5. 
The low p-value (< 0.05) indicates the significance of the 
model. The high value of multiple R-squared (0.9883) 
shows the strong correlation between WQI values and 
predictions from the principal components model. 

Table 6 summarizes the descriptive statistics of the 
WQI which was computed for 735 drinking water sam- 
ples in the study. Figure 8 illustrates the share of each 
water quality classification presented by Sharma and 
Patel [12] based on WQI values. 

Conclusion 

The previous works done by researchers had revealed 
that water quality indices should be set according to 
generic water quality parameters as well as locally im- 
portant variables which may not be of importance in 
other locations. The results of these researches showed 
that the WQI for the monitoring of water quality 
changes with time and location. Hence, the importance 
of the variables, availability of the data, and experts' pro- 
fessional judgements should be considered as the main 



Table 5 Outputs for linear model of 4 retained principal 
components 



Coefficients 


Estimate 


Std. error 


t 


value Pr(>|t|) 


(Intercept) 


29.5616 


0.1157 


255.40 


<2e-16*** 


scores$PC1 


18.6690 


0.1159 


161.08 


<2e-16*** 


scores$PC2 


2.1963 


0.1159 


18.95 


<2e-16*** 


scores$PC3 


2.2290 


0.1159 


19.23 


<2e-16*** 


scores$PC4 


8.3036 


0.1159 


71.64 


<2e-16*** 


Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 " 1. 
Residual standard error: 2.259 on 376 degrees of freedom. 



Multiple R-squared: 0.9883, Adjusted R-squared: 0.9882. 
F-statistic: 7952 on 4 and 376 DF, p-value: < 2.2e-16. 



Table 6 Descriptive statistics of water quality index of 



drinking water samples 



Mean 


29.64 


Quantile% 


WQI 


Standard Error 


0.77 


10 


12.36 


Median 


23.16 


25 


16.685 


Standard Deviation 


20.75 


50 


23.16 


Sample Variance 


430.67 


75 


36.5 


Range 


155.85 


90 


54.416 


Minimum 


6.23 


95 


71.309 


Maximum 


162.08 






Sum 


21787.77 






Confidence Level (95.0%) 


1.50 







cornerstones of WQI development. In this study, the 
Iranian Water Quality Index Software (IWQIS) has been 
set, tested and proved to be an efficient tool to facilitate 
the setting up of water quality indices based on flexible 
use of variables and existing water quality databases. 
The software prepared in this work will help researchers 
and water quality monitoring experts to design and calcu- 
late their own water quality indices easily. The presented 
software can be used by other researchers and communi- 
ties based on the following considerations. 

• The criteria parameters, weights, and limit values 
should be entered into the program according to 
local considerations. 



( ^ 
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Figure 8 The share of each water quality classification based 
on WQI values. 
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• If the data are previously available, IWQIS would be 
a helpful tool to calculate the desired WQI, 
especially if there are some missing values in the 
record set. 

• In cases where samples with many parameters have 
been collected, techniques such as PCA are useful to 
reduce the number of variables. 

• IWQIS can also be used to determine the sensitivity 
analysis of weights attributed to the parameters 
when the allocation of definite weight factors to 
some parameters is controversial. 
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